Regression for citation data: An evaluation of different methods
نویسندگان
چکیده
Citations are increasingly used for research evaluations. It is therefore important to identify factors affecting citation scores that are unrelated to scholarly quality or usefulness so that these can be taken into account. Regression is the most powerful statistical technique to identify these factors and hence it is important to identify the best regression strategy for citation data. Citation counts tend to follow a discrete lognormal distribution and, in the absence of alternatives, have been investigated with negative binomial regression. Using simulated discrete lognormal data (continuous lognormal data rounded to the nearest integer) this article shows that a better strategy is to add one to the citations, take their log and then use the general linear (ordinary least squares) model for regression (e.g., multiple linear regression, ANOVA), or to use the generalized linear model without the log. Reasonable results can also be obtained if all the zero citations are discarded, the log is taken of the remaining citation counts and then the general linear model is used, or if the generalized linear model is used with the continuous lognormal distribution. Similar approaches are recommended for altmetric data, if it proves to be lognormally distributed.
منابع مشابه
خوداستنادی در آیینهی اخلاق
Self-citation is a behavior that is seen to varying degrees in researchers, research centers and medical journals. The question is whether self-citation is moral or not. This is a descriptive and analytical study (library and document research). Two main keywords (self-citation and ethics) were used for searching databases. In addition, efforts have been made for moral evaluation of self-citat...
متن کاملThe online attention to certain nuclear medicine topics: An altmetrics study vs. a citation analysis
Introduction: Traditional citation analysis has been greatly criticized because the process of citation accumulation requires considerable time after publication. So, the term “altmetrics” was proposed in 2010 to measure the scientific and social impact of a paper.We performed a search for certain nuclear medicine topics using the altmetrics approach to report the correlation b...
متن کاملCoronavirus: Scientometrics of 50 Years of Global Scientific Productions
Background: Scientometrics studies are one of the most efficient methods of quantitative evaluation of the scientific outputs of valuable information and citation databases for understanding and observing the status of scientific publications in different subject areas. The main aim of this article was to study the 50 years of Coronavirus scientific publications in the world. Materials & Meth...
متن کاملThe analysis of co-citation and word co-occurrence networks of Iranian articles in the field of dentistry
Background and Aims: Dentistry is an important profession ensuring the health of body and soul, and has a special place in the scientific productions of medical disciplines. The purpose of this study was to analyze the co-citation and word co-occurrence of Iranian research papers in the field of dentistry based on indexed documents in Web of Science from 2014 to 2018. Materials and Methods:...
متن کاملتحلیل محتوایی و استنادی مقالات فصلنامه علمی پژوهشی پیاورد سلامت
Introduction: Citation and content analysis are one of the most common methods for evaluating scientific journals. The aim of this study is analyzing content and citation of Payavard Salamats Journal. Methods: This is a descriptive and cross sectional study. The collecting tool was an author-made check list. The research population included all 164 Published articles in Payavard Salamat jour...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Informetrics
دوره 8 شماره
صفحات -
تاریخ انتشار 2014